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We study the relation between serial correlation of financial returns and volatility at intraday level 
for the S&P500 stock index. At daily and weekly level, serial correlation and volatility are known 
to be negatively correlated (LeBaron effect). While confirming that the LeBaron effect holds also 
at intraday level, we go beyond it and, complementing the efficient market hyphotesis (for returns) 
with the heterogenous market hyphotesis (for volatility), we test the impact of unexpected volatility, 
defined as the part of volatility which cannot be forecasted, on the presence of serial correlations in 
the time series. We show that unexpected volatility is instead positively correlated with intraday 
serial correlation. 

Serial correlation of asset prices is one of the most elusive quantity of financial economics. According to the theory 
of efficient markets [H, HI, it should not exist at all, and when it exists it represents an anomaly of financial markets. 

■ Many economists and physicist devoted themselves to the study of stock return predictability [3|, |4j . Historical returns 
should prevent any forecasting technique, even if it has been shown, like in [j| , that the random walk hypothesis holds 
only weakly. 

On the other hand the variance of financial returns on a fixed time interval, which is called volatility, is a highly 
predictable quantity @, 0] , with its probability distribution function showing fat tails [§] . The natural association of 
volatility to financial risk forecast and control makes its analysis paramount in Economics. To some extent, it seems 
L£h ! obvious therefore to link volatility to returns serial correlation. If anything else, the link between volatility and serial 
correlations can reveal basic properties of the price formation mechanism. 

A notable stylized fact on serial correlation is the LeBaron effect [9|, according to which volatility is negatively 
correlated to serial correlation, relation that has been empirically confirmed at daily level on several markets [I0| . We 
exploit, as in (ill. H^|. high-frequency information (that is, intraday data) to test this effect more precisely than in 
previous studies. With respect to [Ullllj], not only we extend considerably our data set, but also our work is set within 
the framework of the Heterogenous Market Hypothesis [l3| , which implies the modification of some basic assumptions 
, on financial market dynamics due to empirical findings. The main feature of this model is the assumption that agents 

■ must be considered heterogenous, namely, they react differently to incoming news. This implies a completely new 
interpretation of the concept of time in the market, as now every agent has its own dealing time and frequency. More 
importantly, the transactions time, in the agent framework, has to be related to short-, medium-, and long-term 
decisions. Thus, volatility will be itself consistently composed by a cascade of several time components. In this work, 
the heterogeneous market hypothesis is the basis of our method of volatility estimation, introduced for the first time 
in [l4j. A notable advantage in modeling volatility in the context of heterougeneous markets is that it allows to 

. t-H \ reconcile the measure with some empirical findings on volatility (e.g., long-range dependence and fat tails), a feature 

■ which simple autoregressive volatility forecasting models lacks. 

The purpose of this paper is then to exploit intraday measures of volatility and serial correlation to investigate the 
LeBaron effect at a deeper level with a very large and liquid dataset. We find that the LeBaron effect holds. However, 
we also refine the finding of LeBaron by showing that volatility can be splitted into two components: a predictable 
one and an unpredictable one. While the first is negatively correlated with serial correlation, the latter is instead 
positively correlated with serial correlation. 

I. METHODS AND DATA DESCRIPTION 

Let us assume to have rj.,. .. ,r n intraday logarithmic returns. To quantify volatility, we construct daily realized 
volatility measures defined as the cumulative sum of squared intraday five-minutes returns [l5j : 

n 

RVt = Y. r h (!) 

where r^ t is the i-th return in day t. Since volatility has been proved to be log-Normal [l6l |. we use the logarithm of 
RVt to obtain Normal distributions. 
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Measuring daily serial correlation is more subtle, and we borrow from [12J using a modified overlapped Variance 
Ratio. Define: 

1 n 

(2) 
(3) 




(4) 
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where 

m = q(n — q + 1) I 1 — — I . (5) 




We define the variance ratio as follows: 

VR{q) = [^J . (6) 

The use of the power transformation f(x) = x@ makes the distribution closer to a Normal one in small samples [l] 
The expression of Eq. ([6|) is asymptotically Normal with mean 1 and defined standard deviation. (3 is given by: 

2 (EK 1)/2 ^(A,)) (Efe 1)/2 ^(A,)) 

> 3 = 1 -^— , , ,2 S (7) 
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where Wk is the Dirichlct kernel: 



1 sin 2 (fcA/2) 
k sin 2 (A/2) 



^(A) = ~^7r^ (8) 



and Aj = 2irj/n. 

Intuitively, the variance ratio expresses the ratio of variances computed at two different frequencies, whose ratio is 
given by q. If there is no serial correlation in the data, VR(q) should be close to one. In presence of positive serial 
correlation, the variance computed at at the higher frequency is lower, and VR(q) < 1. If instead there is negative 
serial correlation, this argument reverts and VR(q) > 1. We use q = 2,3,4,5,6. Higher values of q cannot be used 
without introducing possible distortions in the statistics behavior [18j | . The VR measure has been shown to be correct 
also for heteroskedastic data generating process [H , and it is defined with overlapping observations [H[ . This measure 
is a reliable measure of serial correlation both at daily [B[ and intraday [TJ [II] level. 

The data set we use is one of the most liquid financial asset in the world, that is the S&P500 stock index futures 
from 1993 to 2007, for a total of 4344 days. We have all high-frequency information, but to avoid for microstructure 
effects we use a grid of 84 daily 5-minutes logarithmic returns, interpolated according to the previous-tick scheme (the 
price at time t is the last observed price before t). These choices are the standard in this kind of applications. 

Relation between volatility and serial correlation can simply be tested using a linear regression 

VR{q)t = b + cAogRVt+e t . (9) 

However, the LeBaron effect tests the relation between serial correlation and lagged volatility, which can be tested 
via the regression: 

VR(q)t=b + c \ogRVt+cilogRVt-i+Et. (10) 

If we want to add further lagged values of realized volatility, we cannot ignore the fact that volatility is well 
known to display long-range dependence. One effective way to accommodate for this stylized fact without resorting 
to the estimation burden of a long memory model is the HAR model of Following what can be termed the 

" Heterogeneous Market Hypothesis" of (l9l |20|. l2~il I22I [23| which recognizes the presence of heterogeneity in traders 
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horizon and the asymmetric propagation of volatility cascade from long to short time periods [2J] , the basic idea that 
emerges is that heterogeneous market structure generates an heterogeneous volatility cascade. Hence, [l4[ proposed a 
stochastic additive cascade of three different realized volatility components which explain the long memory observed 
in the volatility as the superimposition of few processes operating on different time scales corresponding to the three 
typical time horizons operating in the financial market: daily, weekly and monthly. This stochastic volatility cascade 
leads to a simple AR-type model in the realized volatility with the feature of considering realized volatilities defined 
over heterogeneous time periods (the HAR model): 

log RV t = p + (3( d ) log RVt-i 

+ l3 {w) \ogRV t ^l + P [m) \ogRV t (m } +m (11) 
where rjt is a zero-mean estimation error and: 

5 22 

log Rv£ai = J E lo s RV *-*> lo s RV t-x = ^ E lo s RV t->> ■ ( 12 ) 
fc=i fe=i 

Although the HAR model doesn't formally belong to the class of long-memory models, it generates apparent power 
laws and long-memory, i.e. it is able to reproduce a memory decay which is indistinguishable from that observed in 
the empirical data. It is now widely used in applications in financial economics [25l l2q ] . We estimate the HAR model 
with ordinary least squares, and use the estimated coefficients 0oJd),(w),(m) define the predictable volatility as: 

tr P ,t = Po + 0(d) l °S RVt-i + 0( W) log RV t ^l + /3 (m) log RV^ (13) 
and unexpected volatility as the residuals of the above regression: 

o~u,t = fit, (14) 

allowing us to estimate the linear model: 

VR(q) t — b + c u a Vit + c p a Utt + e t (15) 

which fully takes into account heterogeneity, long-range dependence and heteroskedasticity of financial market volatil- 
ity. 



II. RESULTS AND DISCUSSION 



We estimate the regression on a rolling window five years long (Figure^ with q = 2,6) as well as on the whole 
sample (Table 1, q = 2, 3, 4, 5, 6). Results in the first row of Figure[2]may look disappointing, since the LeBaron effect 
is not displayed when looking just at the correlation between realized volatility and variance ratio. Moreover, this 
correlation tends to be slightly positive instead of negative, especially in the first part of the sample. 

However, LeBaron evidenced the negative correlation between r\ and r t r t +\. This is indeed what the second row of 
Figure [2] shows: the coefficient of lagged volatility on variance ratio is negative and significant across all the sample. 

Most interestingly, when the LeBaron effect is correctly disentangled in the data, we find that contemporanoues 
volatility is significantly and positively correlated with the variance ratio. Hence, estimation results for equation [10] 
indicate a sharp difference in the relation between serial correlation and volatility: strongly positive for contempora- 
neous volatility and strongly negative for lagged one. Such antithetical behavior of the relation is even more puzzling 
considering the well known stylized fact of volatility to be highly persistent. How could we explain this result? By our 
heterogeneous "rotation" of the regressors, we can rewrite Equation [10] in the form |15| . This provides the separation 
between predictable and unexpected volatility illustrated in Figure [TJ The new specification greatly helps in shedding 
light on this result providing a precise economic interpretation. Hence, as [12| suggested, we can now provide an 
explanation in term of predictable and unexpected volatility: since volatility is known to be predictable by market 
participants, it has a different impact with respect to its unpredictable component. 

The third row of Figure [2] shows indeed that the predictable volatility, defined by means of the HAR model, is 
negatively correlated with the variance ratio (more with higher q) and that the unpredictable volatility is positively 
correlated with the variance ratio (more with higher q). 

The full sample estimates in Table 1 corroborate this finding. We can then rephrase the LeBaron effect as follows: 
serial correlation is negatively correlated with the expected volatility. Moreover, we can conclude that serial correlation 
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is instead positively correlated with unexpected volatility, which is a previously unrecognized stylized fact of financial 
returns. This stylized fact suggests that the usual explanation of the LeBaron effect in terms of feedback trading 
[27I ] is at least incomplete, advocating for a broader theory on the link between volatility and the way information is 
spread to heterogenous market components. 
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FIG. 1: Top: shows the time series estimate of log RVt used in this study (4344 observations), together with the predictable 
volatility estimated by means of the HAR model. Bottom: shows the time series of unexpected volatility estimated as the 
residuals of the HAR model. 
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FIG. 2: On a rolling window of lenght 5 years, shows: (top) the measures of coefficient Co of regression Q; (middle) the measures 
of coefficients Co and ci of regression (jTOJ ; (bottom) the measures of coefficients Co and ci of regression (|f 5p . On the left column, 
the Variance ratio have been computed with q = 2 (5 minutes); on the right column, with q — 6 (25 minutes). Dashed lines 
represents confidence intervals at 95% confidence level. This figure shows that the correlation between serial correlation and 
predictable volatility is negative (LeBaron effect), while the correlation between serial correlation and unexpected volatility is 
positive. 
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TABLE I: On the full sample, estimated regression. Standard errors are in brackets. The adjusted R 2 is indicated by R 2 and 
is expressed in percentage form. 



Regression [9] Regression \W\ Regression [15] 

c R 2 (%)\ c ci R 2 (%)\ c u c p R 2 



-0.242 1.40 
(0.805) 

-0.949 0.44 
(0.933) 

-0.160 -0.01 
(0.978) 

0.325 0.05 
(1.051) 

0.810 0.23 
(1.108) 



9.026 -11.851 3.82 

(1.506) (1.461) 

10.742 -14.950 3.59 

(1.754) (1.656) 

12.838 -16.608 3.67 

(1.823) (1.697) 

14.499 -18.108 3.95 

(1.969) (1.777) 

16.477 -20.018 4.42 

(2.118) (1.902) 



12.707 -6.376 4.85 

(1.702) (0.958) 

13.478 -7.780 3.93 

(1.998) (1.156) 

14.645 -7.080 3.45 

(2.028) (1.210) 

15.860 -6.903 3.45 

(2.154) (1.205) 

17.603 -6.986 3.73 

(2.280) (1.211) 



